Adaptive high accuracy approaches to speech activity detection in noisy and hostile audio environments
نویسندگان
چکیده
This study examines the difficult task of Speech Activity Detection (SAD) in two hostile environments: AM push-to-talk air traffic control and international telephone conversations with very low SNRs. Due to the poor performance of traditional energy-based SAD, two novel approaches to SAD were developed that specifically target spectral characteristics that typify speech, rather than trying to separate out the background, which can vary enormously. As a result these approaches are inherently adaptive to their environments. A Speech Energy Resonance Band Detection approach and a Harmonic Product Spectrum clustering approach to SAD are described in this paper and their performance evaluated against MIT Xtalk and the Teager Energy Operator (TEO) in clean and hostile environments.
منابع مشابه
Statistical Tests for Voice Activity Detection
A robust and effective voice activity detection (VAD) algorithm is proposed for improving speech recognition performance in noisy environments. The approach is based on filtering the input channel to avoid high energy noisy components and then the determination of the speech/non-speech bispectra by means of third order autocumulants. This algorithm differs from many others in the way the decisi...
متن کاملBispectra Analysis-Based VAD for Robust Speech Recognition
A robust and effective voice activity detection (VAD) algorithm is proposed for improving speech recognition performance in noisy environments. The approach is based on filtering the input channel to avoid high energy noisy components and then the determination of the speech/non-speech bispectra by means of third order autocumulants. This algorithm differs from many others in the way the decisi...
متن کاملEfficient voice activity detection algorithm using long-term spectral flatness measure
This paper proposes a novel and robust voice activity detection (VAD) algorithm utilizing long-term spectral flatness measure (LSFM) which is capable of working at 10 dB and lower signal-to-noise ratios(SNRs). This new LSFM-based VAD improves speech detection robustness in various noisy environments by employing a low-variance spectrum estimate and an adaptive threshold. The discriminative powe...
متن کاملAn Efficient VAD Based on a Hang-Over Scheme and a Likelihood Ratio Test
The emerging applications of wireless speech communication are demanding increasing levels of performance in noise adverse environments together with the design of high response rate speech processing systems. This is a serious obstacle to meet the demands of modern applications and therefore these systems often needs a noise reduction algorithm working in combination with a precise voice activ...
متن کاملAn Efficient VAD Based on a Generalized Gaussian PDF
The emerging applications of wireless speech communication are demanding increasing levels of performance in noise adverse environments together with the design of high response rate speech processing systems. This is a serious obstacle to meet the demands of modern applications and therefore these systems often needs a noise reduction algorithm working in combination with a precise voice activ...
متن کامل